Performance Improvement of Data Mining in Weka through GPU Acceleration

نویسندگان

  • Tiago Augusto Engel
  • Andrea Schwertner Charão
  • Manuele Kirsch-Pinheiro
  • Luiz Angelo Steffenel
چکیده

Data mining tools may be computationally demanding, so there is an increasing interest on parallel computing strategies to improve their performance. The popularization of Graphics Processing Units (GPUs) increased the computing power of current desktop computers, but desktop-based data mining tools do not usually take full advantage of these architectures. This paper exploits an approach to improve the performance of Weka, a popular data mining tool, through parallelization on GPU-accelerated machines. From the profiling of Weka object-oriented code, we chose to parallelize a matrix multiplication method using state-of-the-art tools. The implementation was merged into Weka so that we could analyze the impact of parallel execution on its performance. The results show a significant speedup on the target parallel architectures, compared to the original, sequential Weka code. c © 2014 The Authors. Published by Elsevier B.V. Selection and peer-review under responsibility of Elhadi M. Shakshuki.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Guest editorial: new developments in future networked systems

This special issue is based on the best papers from the 5th International Conference on Emerging Ubiquitous Systems and Pervasive Networks (EUSPN-2014), which was held in Halifax, Canada, on 22–25 September 2014. The conference attracted a large number of scientific papers that contributed to the state-of-the-art in the ground-breaking invention to future communication technologies, including m...

متن کامل

An Analysis of Urban and Rural Students Programming Skills Performance Using Clustering Techniques

37 Abstract— Educational Data mining is a recent trend where data mining methods are experimented for the improvement of student performance in academics. The work describes the mining of higher education urban and rural students’ related attributes such as improvement, behavior, attitude and relationship. The data were collected from Dr.Ambedkar Government Arts College in terms of the mentione...

متن کامل

Application of Kansei engineering and data mining in the Thai ceramic manufacturing

Ceramic is one of the highly competitive products in Thailand. Many Thai ceramic companies are attempting to know the customer needs and perceptions for making favorite products. To know customer needs is the target of designers and to develop a product that must satisfy customers. This research is applied Kansei Engineering (KE) and Data Mining (DM) into the customer driven product design proc...

متن کامل

Developing innovative applications in agriculture using data mining

The WEKA (Waikato Environment for Knowledge Analysis) system provides a comprehensive suite of facilities for applying data mining techniques to large data sets. This paper discusses a process model for analyzing data, and describes the support that WEKA provides for this model. The domain model ‘learned’ by the data mining algorithm can then be readily incorporated into a software application....

متن کامل

High Performance Implementation of Fuzzy C-Means and Watershed Algorithms for MRI Segmentation

Image segmentation is one of the most common steps in digital image processing. The area many image segmentation algorithms (e.g., thresholding, edge detection, and region growing) employed for classifying a digital image into different segments. In this connection, finding a suitable algorithm for medical image segmentation is a challenging task due to mainly the noise, low contrast, and steep...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014